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Abstract 

We determine the Mobius function of the poset of compositions of an integer. In 
fact we give two proofs of this formula, one using an involution and one involving 
discrete Morse theory. The composition poset turns out to be intimately connected 
with subword order, whose Mobius function was determined by Bjorner. We show that 
using a generalization of subword order, we can obtain both Bjorner's results and our 
own as special cases. 



If A is any set then the corresponding Kleene closure or free monoid, A* is the set of 
words with letters from A, i.e., 

A* = {w = w(l)w(2) . . . w(n) | n > and w(i) G A for all i}. 

We denote the length (number of elements) of w by \w\. 

Letting P denote the positive integers, we see that P* is the set of integer compositions 
(ordered partitions). We can turn P* into a partially ordered set by letting u < w if there 

*This work was partially done while the author was on leave at DIMACS 

t Partially supported by an award from DIMACS and an NSF VIGRE grant to the Rutgers University 
Department of Mathematics. 
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is a subword w{ix)w{%2) ■ ■ ■ w(ii) of w having length I = \u\ such that u(j) < w(ij) for 
1 < i < I- For example, 433 < 16243 because of the subword 643. Bergeron, Bousquet- 
Melou and Dulucq [2] were the first to study P*, enumerating saturated chains that begin 
at its minimal element. Snellman has also studied saturated chains in this poset as well as 
two other partial orders on P* [20, 21]. One of the main results of this paper is a formula 
for the Mobius function of P* . 

This order on P* is closely related to subword order. If A is any set then the subword 
order on A* is defined by letting u < w if w contains a subsequence w(ii), wfa), . . . , w{i{) 
such that u(j) = w(ij) for 1 < j < I — \u\. By way of illustration, if A = {a, b} then 
abba < aabbbaba because w(l)w(3)w(5)w(8) = abba. Note that we use the notation A* when 
referring to subword order as opposed to the partial order on P*, even though we use < 
for both. We will always give enough context to make it clear which poset we are dealing 
with. Bjorner [4] was the first to completely determine the Mobius function of subword 
order, although special cases had been obtained previously by Farmer [12] and Viennot [23]. 
In fact, Bjorner gave two proofs of his formula, one using an involution [5] and one using 
shellability [4]. He also gave a demonstration with Reutenauer [6] via generating functions on 
monoids. Another proof was given by Warnke [25] using induction while Wang and Ma [24] 
used Cohen-Macaulayness to investigate \i. We derive the Mobius function formula for P* 
using first combinatorial and then topological techniques. 

The rest of this paper is structured as follows. In the next section we review Bjorner 's 
result for subword order as well as the related definitions which will be useful for P*. It 
contains a statement of our formula for the Mobius function of P* in Theorem 1.2. Section 2 
is devoted to giving a proof of this theorem using a sign-reversing involution. 

Although intervals in A* are shellable, those in P* need not even be connected as evident 
from the example in Figure 2, so we need a more powerful tool to study the topology of 
the composition poset. For this we turn to discrete Morse theory, which was developed by 
Forman [13, 14] and can be used to compute the homology of any CW-complex. A method 
for applying this theory to the order complex of a poset was given by Babson and Hersh [1] 
and further studied by Hersh herself [15]. Since this is a relatively recent addition to the 
combinatorial toolbox, we provide an exposition of the basic ideas of the theory in Section 3. 
The subsequent section gives a Morse theoretic proof of Theorem 1.2. 

The similarity between the formulas for the Mobius functions of A* and P* leads one to 
ask if there is a common generalization. In fact, if P is any poset then there is a partial order 
on P* which we call generalized subword order. It has been used in the context of well-quasi- 
ordering; see Kruskal's article [17] for a survey of the early literature. When P is a chain or 
an antichain in the present context, one recovers our results or Bjorner's, respectively. This 
construction is studied in Section 5. Finally, we end with a section of comments and open 
problems. 

1 Subword and composition order 

We now review Bjorner's formula for the Mobius function of subword order, reformulating 
it slightly so as to emphasize the connection to the composition order which is our main 
objective. We assume the reader is familiar with Mobius functions, but all the necessary 
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definitions and theorems we use here can be found in Stanley's text [22, §3.6-3.7]. 

We first need to restate the definition of the partial order in A* in a way that, although 
slightly more complicated, has a direct connection with the Mobius function. Suppose we 
have a distinguished symbol, 0, and suppose that G" A. Then a word r\ = 77(1)77(2) . . . r)(n) G 
(A U 0)* has support 

Supp7/ = {i I Tj(i) 7^ 0}. 

An expansion of u G (AUO)* is a word r? n G (AUO)* such that the restrictions of u and 77 u to 
their supports are equal. For example, if u = abba then one expansion of u is rj u = aObObOOa. 
An embedding of u into w is an expansion rj uw of it which has length \w\ and satisfies 

tywuOO = w(i) for all i G Suppr^™. 

Note that u < w in A* if and only if there is an embedding of u into w. The example 
expansion n uw given above is exactly the embedding which corresponds to the subword of 
w = aabbbaba given at the beginning of the third paragraph of this paper. If w is clear from 
context we simply write r) u for i] uw . 

The Mobius function of A* counts certain types of embeddings. If a G A then a run of 
a's in w is a maximal interval of indices [r, rj] such that 

it;(r) = w(r + 1) = . . . = w(t) = a. 

Continuing our example, the runs in w = aabbbaba are [1,2], [3,5], [6,6], [7,7], and [8,8]. 
Call an embedding n u into w normal if, for every a G A and every run [r, t] of a's, we have 

(r, t] C Supp ?] n 

where (r, £] denotes the half-open interval. In our running example, this means that the 
second a and the fourth and fifth 6's must be in any normal embedding. (If r = t then 
(r, t] = 0, so there is no restriction on runs of one element.) Thus in this case there are 
precisely two normal embeddings of u into w, namely 

r) u = OaObbaOO and OaObbOOa. 

Define ( w ) to be the number of normal embeddings of u into w. 
We now have everything in place to state Bjorner's result. 



Theorem 1.1 (Bjorner [4]). Ifu,w G A* then 



fi(u,w) = (-l)M-M(^J . □ 

Finishing our example, we see that 

fi{abba, aabbbaba) = (-1) 8 " 4 -2 = 2. 

We now turn to P*. The definitions of support and expansion are exactly as before, but 
the notion of embedding must be updated to reflect the different partial order. To this end 
we define an embedding of u into w as an expansion 77^ of u having length \w\ such that 

Vu{i) < w(i) for 1 < i < \w\. 
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Again, u < w in P* is equivalent to the existence of an embedding of u into w. An interval 
in P* is displayed in Figure 2. 

If u < w and rj w is an expansion of w then there is a unique last or rightmost embedding 
p uw of u into rj w which has the property that for any other embedding rj uw of u into rj w one 
has Supp(77 ut „) < Supp(p uw ). (If S — {i\ < ■ ■ ■ < i m } and S' = {i[ < ■ ■ ■ < i' m } then we 
write S < S' to mean that ij < for 1 < j < m.) Note that p uw depends on r] w , not just 
on w, but the expansion of w used will always be clear from context. 

Like subword order, we define normal embeddings for P* in terms of runs (defined in the 
same way in this context). We say that an embedding r] u into w is normal if the following 
conditions hold. 

1. For 1 < i < \w\ we have r] u {i) = w(i), w{i) — 1, or 0. 

2. For all k > 1 and every run [r, t] of /c's in w, we have 

(a) (r, t] C Supp?7 u if k = 1, 

(b) r G Supp r/ u if A; > 2. 

Comparing this with Bjorner's definition, we see that in P* a normal embedding can have 
three possible values at each position instead of two. Also, the run condition for ones is the 
same as in A*, while that condition for integers greater than one is complementary. As an 
example, if w = 2211133 and u = 21113, then there are two normal embeddings, namely 
7] u = 2101130 and 2011130. Note that 2001113 and 0211130 are not normal since they violate 
conditions (1) and (2), respectively. 

The sign of a normal embedding depends on the embedding itself and not just the length 
of the compositions. Given a normal embedding r] u into w we define its defect to be 

d(r]u) = I Vu(i) = w(i) - 1}. 
We can now state our main theorem about P*. 
Theorem 1.2. Ifu,w G P* then 

where the sum is over all normal embeddings rj u into w. 

In the example of the previous paragraph, this gives 

^(21113, 2211133) = (-1) 2 + (-1)° = 2. 

Although this example does not show it, it is possible to have cancellation among the terms 
in the sum for p. 
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2 Proof by sign-reversing involution 



We now prove Theorem 1.2 using a sign- reversing involution. The proof is similar in nature 
to Bjorner's proof in [5], but is significantly more complicated. 

Proof (of Theorem 1.2). If u — w then there is exactly one normal embedding rj ww and it 
has defect 0. This gives (—1)° = 1 = p(w,w), as desired. 

Now assume that u < w = w(l) ■ ■ -w(n). Since the Mobius recurrence uniquely defines 
/i, it suffices to show that 

where the inner sum is over all normal embeddings of v into w. We prove this by constructing 
a sign-reversing involution on the set of normal embeddings rj vw for v G [u, w}. 
Let p uv denote the rightmost embedding of u into rj vw , so for all i G [1,77.], 

Puv{i) < r]vw(i) < (1) 

Also let s denote the left-most index where p uv and w differ, i.e., s = mm{i : p uv {i) ^ w(i)}. 
Since u < w, s must exist, and by the definition of s we have 

Puvii) = Vvw{i) = w(i) for all % G [1, s). (2) 

Set k = w(s), so r] vw (s) is k, k — 1, or by normality and p uv (s) < k — 1. Finally, let [r,t] 
denote the indices of the run of fc's in w that contains the index s. 

Our involution maps r] vw to the embedding rfy W = /^^(l) • • • rfy W (n) where rf^ w {i) = r] vw {i) 
for all i s and rjv W (s) is determined by the following rules. If k — 1 then 

1 if^(,) = 0. (dj 

If k > 2, s > r, and p U v{s) = then 

_ r ^ _ / if Vvw(s) = k- i, 

I if^( s ) = o. l4J 

Finally, if A; > 2 and either s = r or p uv (s) ^ then 

/ \ _ J ^ — 1 if Vvw( s ) = k, 

~ \ fc if ^(s) = k-l. { } 

It is not obvious that this map is defined for all normal embeddings r/ vw when k > 2. For 
example, if s > r and p uv (s) = 0, then we should apply (4), but it is a priori possible that 
Vvw( s ) — k, in which case (4) is not defined. However, since s > r, p uv (s — 1) = w(s — 1) = k 
by (2), and this contradicts our choice of p uv as the rightmost embedding of u into rj vw . 
Similar issues arise when s = r or p uv {s) ^ 0: if s = r then rj vw (s) ^ by normality, and if 
p™0) 7^ then 7/„ w (s) 7^ by (1). 

Having established that this map is indeed defined on all normal embeddings, we have 
several properties to prove. First, it is evident from (3), (4), and (5) that the number 
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of elements equal to k — 1 changes by exactly one in passing from r\ vw to r^ Wl and thus 
(— 1)^™ = —( — l) Vvw . We must now prove that rjy W is a normal embedding of v into w for 
some U G [it, it)] and that this map is an involution. 

We begin by showing that rfy W is an embedding of some v G [u,w] into w. It follows 
from the fact that rj vw is an embedding into w and the definition of our map that % w is an 
embedding of some word v into w, so we only need to show that v > u. We prove this by 
showing that 

Vvw(i) > Puv{i) (6) 

for all i G [l,n]. This is clear for all i ^ s because for these indices r)y W (i) = r] vw (i) > p uv (i). 
Furthermore, p uv {s) < k — 1 by the definition of s, so the only case in which (6) is not 
immediate is when k > 2 and rfjj w (s) = 0. However, this can only occur from using (4), 
which requires that p uv (s) = 0, completing the demonstration that v G [u, w\. 

We now aim to show that r^ w is a normal embedding. If rfy W (s) > then Supp (??„„,) ^ 
Supply,), so the normality of tj^w follows from the normality of r] vw and the fact that rfy W (s) 
is either k — 1 or k. If % w (s) = then there are two cases depending on whether (3) or 
(4) was applied. Suppose first that (3) was applied, so k = 1. Comparing (3) with the 
definition of normality, we see that it suffices to show s — r. Suppose to the contrary that 
s G (r, £]. Since s is the left-most position at which p uv and w differ, we have = and 

p uv {s — 1) = 1. However, this contradicts our choice of p uv as the rightmost embedding of u 
in rj vw . Now suppose that (4) was applied, so s > r and p uv {s) = 0. Then (2) implies that 
Vvw(r) = r] vw {r) = k, and normality is preserved. 

It only remains to show that this map is an involution. Consider applying the map to 
r^w In this process we define p U y to be the rightmost embedding of u into rjy W , s = min{i : 
Puv(i) 7^ w (i)}, an d k = w(s). We then follow the rules (3), (4), and (5) to construct an 
embedding rj= w , which we would like to show is equal to r) vw . 

First we claim that p u ^ = p uv . Suppose to the contrary that p ulj ^ p uv . By (6), rfcwii) > 
Puv(i) for all i, so p uv also gives an embedding of u into v, and thus the only way we can 
have p ul j 7^ p uv is if p uv - is further to the right than p uv . This requires that 

= puv(s) < puu(s) < k (7) 

and that 

T)vw{s) < 7}y W (s). (8) 

Because we are assuming that py W is further to the right than p vw , there is some position 
to the left of s at which p uv is nonzero. In fact, (2) shows that we must have p uv {s — 1)^0 
and also implies that 

Vvw(s) < p ul j(s) = p uv (s - 1) = w(s - 1). (9) 

We now consider the three cases arising from each of the rules (3), (4), and (5) in turn. 

Suppose (3) was applied so k — 1. It follows from (8) and the definition of our map that 
Vvw ( s ) = 0. Also (7) and (9) give w(s — 1) = p U v(s) = 1. But then r] vw zeroed out a 1 which 
was not the first in its run, contradicting normality. 

Now suppose (4) was applied. Then by (8) we have rpu W (s) = k — 1. We also have s > r 
which in conjunction with (9) gives p U v(s) = w(s — l) = k. This implies that rjy W (s) < p U y(s), 
but that contradicts the v version of (1). 
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Figure 1: A Morse function on a CW-complex 



Finally suppose that (5) was used. By (7) it must be the case that s = r. Also, 
equation (8) gives r] vw (s) = k — 1 and ffu w (s) = k. Now applying (7) and (9) we have 
k — 1 < w(s — 1) = p U v{s) < k, so w(s — 1) = k which contradicts that fact that s = r. 

Now that we have established the equality of p uv and p uV , the fact that this map is an 
involution can be readily observed. We must have s = s, so k = k, and thus we apply the 
same rule to go from rjv W to rj= w as we applied to get r^ w from 77^, and each of these rules 
is clearly an involution. □ 

3 Introduction to discrete Morse theory 

In this section we review the basic ideas behind Forman's discrete Morse theory [13, 14] 
as well as Babson and Hersh's method for applying the theory to the order complex of a 
poset [1]. 

Let A be a CW-complex. Since we will be working in reduced homology, we assume that 
X has an empty cell of dimension —1 which is contained in every cell of A. If a is a d-ce\\ 
(cell of dimension d) in A then let a 9 be the set of (d — l)-cells r which are contained in the 
closure a. Dually, let a 8 denote the set of (d + l)-cells r such that a C r. 

A real- valued function / on the cells of A is a Morse function if it satisfies the following 
two conditions. 

1. For every cell a of A we have 

(a) #{r G a 9 | /(r) > f(a)} < 1, and 

(b) #{r G a s I f(r) < f(a)} < 1. 

2. If r G a s and /(r) < f(o~) then a is a regular face of r. 

Intuitively the first condition says that, with only certain exceptions, / increases with di- 
mension. In fact, f(o~) = dimer is a perfectly good Morse function on X, although we will 
see shortly that it is not very interesting. A simple example of a Morse function on a CW- 
complex is given in Figure 1 where the value of / is given next to each cell a and we also set 

/(0) = -1. 
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The fact that condition (1) holds for every cell implies that, in fact, at most one of the 
two sets under consideration has cardinality equal to 1. Thus the function / induces a Morse 
matching between pairs of cells a, r with a E t 9 and f(cr) > /(r). The regularity condition 
ensures that for each such pair there is an elementary collapse of r onto f-(rUff). The 
cells which are not matched by / are called critical. Since each collapse is a homotopy 
equivalence, X can be collapsed onto a homotopic complex X* built from the critical cells. 
In our example, the cells labeled 1 and 2 are matched and after collapsing we clearly have 
a complex which is still homotopically a circle. Note that if we take / to be the dimension 
function then every cell is critical and X* = X, so the cell complex does not simplify in this 
case which does not help in understanding its structure. 

Let rhd be the number of critical d-cells of X and let bd be the <i-th reduced Betti 
number over the integers. We also use x(X) for the reduced Euler characteristic. From the 
considerations in the previous paragraph, we have the following Morse inequalities which 
are analogous to those in traditional Morse theory. 

Theorem 3.1 (Forman [14]). For any Morse function on a cell complex X we have 
1. bd < rhd for d > —1, and 



(One can get further inequalities relating various partial alternating sums of the bd and 
rhd-) Continuing our example, we see that m_i = rh = rh\ = 1 which bound 6_i = b = 
and b\ = 1, as well as x(X) = — 1 + 1 — 1 = — 1, as expected. 

We now turn to the special case of order complexes. Let P be a poset and consider 
an open interval (u, w) in P. The corresponding order complex A(u,w) is the abstract 
simplicial complex whose simplices (faces) are the chains in (u,w). We are interested in the 
order complex because of the fundamental fact [19] that 



Therefore finding a Morse function for A (it, w) could permit us to derive the correspond- 
ing Mobius value as well as give extra information about its Betti numbers. Suppose we 
have an ordering of the maximal chains of (u, w) (facets of A(u, w)), say C\, C2, . . . , C\. Call 
a face (subchain) a of Ck new if it is not contained in any Cj for j < k. We would like to 
construct a Morse matching inductively, where at the kth stage we extend the matching on 
the faces in Cj for j < k by matching up as many of the new faces a in Ck as possible. It 
turns out that under fairly mild conditions on the facet ordering, one can construct such a 
matching so that all the new faces in Ck are matched if there are an even number of them, 
and only one is left unmatched if the number is odd. Thus adding each facet contributes 
at most one critical cell. A maximal chain contributing a critical cell is called a critical 
chain. In reading the details of this construction, the reader may find it useful to refer to the 
example of the interval [322, 3322] C P* given in Figure 2. Note that by abuse of notation 
we include u and w when writing out a maximal chain C, even though C is really a subset 
of the open interval (u, w). Also, because of the way our chain order is constructed, we start 




□ 



d>-\ 



p(u,w) = x(A(w,w)). 



(10) 
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3322 




2322 3222 3312 3321 




1322 3122 3212 3221 332 




322 



d : 3322 1 2322 _J_ 1322 1 0322 

C 2 : 3322 2 (3222 _JL_ 3122) 2 3022 

C 3 : 3399 2 3999 3 ( ggjj) 3 39D9 

C 4 : 3322 2 3222 4 (3221) 4 3220 

C 5 : 3322 _ 3 _ (33T2) _ 2 _ 3212 _L_ 3202 

C 6 : 3399 3 3312 3 ( 33Q2) 2 C.9Q9 

C 7 : 3322 4 3321 2 ( 3221) 4 3220 

C 8 : 3399 4 (3321)^^(3320) 2 3990 



Figure 2: The interval [322, 3322] and its maximal chains 

with the top element w and work down to u which is dual to what is done normally. Thus 
in a chain C, terms like "first" and "last" refer to this ordering of C's elements. Finally, we 
list the elements of a chain as embeddings into w for reasons which will become apparent 
when we also describe the labels given to the edges (covers) of a chain. 

To define the types of chain orderings we consider, suppose we have two chains C : w = 

v Vi . . . u and C : w = v' Q v[ . . . u where x y means that 

x covers y. Then we say that C and C agree to index j if Vi = v[ for i < j. In addition, 
C and C diverge from index j if they agree to index j and Vj + i ^ v j+i- m addition, we 
use the notation C < C to mean that C comes before C in the order under consideration. 
An ordering of the maximal chains of [u, w] is a poset lexicographic order, or PL-order for 
short, if it satisfies the following condition. Suppose C and C diverge from index j with 
C < C. Then for any maximal chains D' and D which agree to index j + 1 with C and 
C, respectively, we must have D' < D. Note that orderings coming from the EL-labelings 
introduced by Bjorner [3] or from the more general CL-labelings of Bjorner and Wachs [8] 
are PL-orders as long as one breaks ties among labels consistently. 

The PL-order we use in P* is as follows. If x y is a cover then y is obtained from x 

by reducing a single part of x by 1. Thus there is a unique normal embedding of y into x, 
since if a 1 is reduced to then it must be the first element in the run of ones to which it 
belongs. Similarly, for any expansion rj x there is a unique normal embedding of y into rj x . 

Now given any chain C : w = vq v\ . . . u we inductively associate with each Vj 

an embedding rj Vj into w where r/ Vo = w and, for j > 0, rj Vj+1 is the unique normal embedding 
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of Vj+i into rj Vj . We label the edge Vj Vj+\ of C with the index i of the position which 

was decreased in passing from rj v . to r] Vj+1 . Furthermore, we often write rj v . in place of Vj 
when listing the elements of C. Figure 2 illustrates this labeling. It is important to note 
that although rj v +1 is normal in rj v ., it need not be normal in w. We should also remark that 
this labeling is similar to the one used by Bjorner [4] in his CL-shelling of the intervals in 
subword order. Finally, if one orders the chains of [u, w] using ordinary lexicographic order 
on their label sequences, then the result is a PL-order. This is due to the fact that if two 
chains agree to index j then their first j labels are the same. The chains in Figure 2 are 
listed in PL-order. 

We no return to the general exposition. To construct our matching, when we come to a 

chain C : w = vq V\ . . . u in a given order we must be able to determine which 

faces of C are new. Denote the open interval / from Vi to Vj in C by 

/ = C(Vi, Vj) = V i+1 V i+2 Vj-i. 

(Do not to confuse this with an open interval in the poset.) Then / is a skipped interval if 
C — I C C for some C < C. It is a minimal skipped interval or MSI if it does not strictly 
contain another skipped interval. In Figure 2, the MSI's are circled. One can find the MSI's 
by taking the maximal intervals in C — (C H C) for each C < C and then throwing out any 
that are not containment minimal in C . Let I = 1(C) be the set of MSI's in C . Then it is 
easy to check that a face a is new in C if and only if a has a nonempty intersection with 
every I G 1(C). 

The set X in not quite sufficient to construct the matching because the MSI's can overlap 
and we will need disjoint intervals. However, there are no containments among the intervals 
in X, so they can be ordered I±, I2, 13, ■ ■ ■ according to when they are first encountered on 
C. We now inductively construct a set J = 3(C) of J -intervals as follows. Let J\ — I\. 
Then consider the intervals I' 2 = I2 — Ji,I's = h — Ji, ■ ■ ■', throw out any which are not 
minimal; and pick the first one which remains to be J2. Continue this process until there 
are no nonempty modified MSI's left. It happens that in all the critical chains for P*, the 
intervals in X will already be disjoint and so we will not need this step. 

We are finally in a position to describe the matching. List the maximal chains of [u, w] 
using a PL-order. A family K, of intervals of a maximal chain C covers the chain if C = U/C. 
There are three cases depending on whether 1(C) or J(C) covers C or not. First suppose 
that 1(C) does not cover C, so neither does J7"(C), and pick xq to be the first vertex in C— UX. 
Consider the map a — > a A{xq} where A is symmetric difference (not the order complex). 
One can show that this map is a fixed-point free involution on the new faces a in C which 
extends the Morse matching already constructed from the previous chains. Now suppose 
that J = { Ji, J 2 , . . . , J r } does cover C and consider the new face o~c = {xi, %2, ■ ■ ■ , x r } 
where X{ is the first element of Ji for 1 < i < r. Given any other new face a 7^ ac, we find 
the interval Ji of smallest index where a R Ji ^ {xi} and map a — > aA{xi}. This involution 
pairs up all new faces in C except ac, which is critical. Finally, suppose that X covers C 
but J does not. Then we use the mapping of the second case to pair up all new faces whose 
restriction to U.7 is different from ac- We also pair up the remaining new faces (including 
ac) by using the mapping of the first case where we take Xq to be the first vertex in C — VJJ. 
Thus we have outlined the proof of the following theorem, remembering that the dimension 
of a simplex is one less than its number of vertices. 
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Theorem 3.2 ([1]). Let P be a poset and [u,w] be a finite interval in P. For any PL- 
order on the maximal chains of [u, w], the above construction produces a Morse matching in 
A(u,w) with the following properties. 

1. The maximal chain C is critical if and only if J covers C . 

2. If C is critical then its unique critical cell has dimension j^J(C) — 1. □ 

4 A Morse theory derivation of ll 

We are now ready to find the critical cells for the PL-order in P* defined previously. We first 
need three lemmas which will prove useful in a number of cases. Unless otherwise specified, 
we always use the notation 

C : 7/) = y h Vl h h l j_v d = u (11) 

for labeled maximal chains, or 

C : w = rj vo _A_ rj Vl Ja_ r] V2 J2 1 ±_ r] u (12) 

if we wish to be specific about the embeddings determined by C. We also use 

l(C) = (h,l 2 ,...,l d ) 

for its label sequence. 

Take an interval [u,w] C P* with \u\ = \w\ and let m, = w(i) — u(i). Now consider 
the multiset M uw = {{l mi ,2 m2 , . . .}} where i mi means that i is repeated mi times. Then 
every permutation of M is the label sequence for a unique maximal chain in [u, w] and this 
accounts for all the chains. (In fact, [u, v] is isomorphic to the poset of sub multisets of M uw .) 
We record this simple observation for later reference. 

Lemma 4.1 (Same Length Lemma). If \u\ = \w\ then the the label function I gives a 
bijection between the maximal chains in [u,w] and the permutations of M uw . In particular, 
if M uw contains only one distinct element (possibly with multiplicity) then [u,w] contains a 
unique maximal chain. □ 

If jit | < then we no longer have the nice bijection of the previous paragraph, but 
we can still say something. Let C be a maximal chain as in (11) and let I' = . . . , l' d ) 
be any permutation of the label sequence 1(C). Then I' defines a sequence of expansions 
T) v ' o , r} v ' v . . . , rj v ' d where r) v > Q = w and for j > 1 we get r) v >. from r\ v >, ^ by subtracting one from 

position l'j in Vv' j _ 1 - It is still true that C : w = v' v[ . . . v' d = u is a 

maximal chain in [u, w]. We call C the chain specified by I'. Since n v i may not be a normal 
embedding in 1)^ , we may not have 1(C) = I'. Still, at the first place where 1(C) and I' 
differ, that difference must have been caused because using the label in I' would have resulted 
in changing a 1 to a where that 1 was not the first in its run. Thus the corresponding 
normal embedding in 1(C) uses the first 1 in that run which is to the left. Hence 1(C) < V 
in lexicographic order. We summarize this discussion in the following lemma. 
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Lemma 4.2 (Chain Specification Lemma). If C is a maximal chain in [u,w] and I' is 
any permutation of 1(C) then 1(C) < I' where C is the chain specified by V . □ 

As our first application of the Chain Specification Lemma, we can determine what hap- 
pens at descents. A descent of C is Vj G C such that lj > An ascent is defined by 
reversing the inequality. 

Lemma 4.3 (Descent Lemma). If Vj is a descent of C then it is an MSI. 

Proof. Let /' be the permutation of 1(C) gotten by interchanging lj and lj + i and let C be 
the chain specified by I'. Then by Lemma 4.2 we have 1(C) <l'< 1(C), so C comes before 
C in PL-order and it is easy to check that C diverges from C at and rejoins C at Vj + \. 
Thus {vj} is a skipped interval; and since the interval contains only one element it must also 
be minimal. □ 

We only need a few more definitions to state our result characterizing the critical chains. 
A chain C will be said to have a certain property, e.g., weakly decreasing, if 1(C) has that 
property. Also, if r\ u is a normal embedding into w then we need to keep track of the zero 
positions which did not come from decreasing a 1 in w by letting 

D(Vu) = #{* I ?/u(i) = 0, w(i) > 2}. 
Theorem 4.4. Consider the maximal chains in [u, w] C P* in the given PL-order. 

1. There is a bisection between critical chains C and normal embeddings t] u into w where 
the chain corresponding to t] u is the unique weakly decreasing chain ending at rj u . 

2. If C is critical and ends at rj u then 1(C) = 3(C) and 

#l(C)=d( Vu ) + 2D(r ]u )-l. 

We shall prove this theorem by considering 3 cases: when C is weakly decreasing and 
ends at a normal embedding, when C is weakly decreasing and does not end at a normal 
embedding, and when C is not weakly decreasing. Note that for any embedding r) u into w, 
there is at most one decreasing chain ending at rj u , and that if n u is normal then such a chain 
will exist because it will be possible to make each cover normal. Thus there is a bijection 
between normal embeddings and weakly decreasing chains ending at them, but we need to 
show such chains are critical. To do so, we define a plateau of C to be an interval C(vi, Vj) 
such that (h+i, k+2, ■ ■ ■ , lj) is a run of length at least 2 in 1(C). 

Proposition 4.5. If C is weakly decreasing and ends at a normal embedding rj u then C is 
critical, 1(C) = J(C), and #J(C) = d(r] u ) + 2D( Vu ) - 1. 

Proof. Every descent is an MSI of C by the Descent Lemma, so any other MSI must be 
contained in a plateau by minimality. In fact, we claim that any plateau C(vi, Vj) is an MSI. 
Without loss of generality we can assume Vi = w (since otherwise is a descent and so no 
MSI can contain it) and Vj = u. 

To show C = C(w,u) is a skipped interval, first note that by construction 1(C) consists 
of c repeated k = j — i > 2 times, so w(c) = rj u (c) + k. Thus by normality and the fact that 
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k > 2 we get ry u (c) = and w(c) = k. Using normality again implies that c cannot be the 
first element in its run of fc's in w, and thus w(c — 1) = k. Because of this, there is a chain 
C from w to u all of whose labels are c — 1. By construction C < C and C H C" = 0, so C 
is a MSI as desired. 

To show the plateau is minimal suppose, to the contrary, that there is a skipped interval 
I C C and let w = v , u = Vj. Note that because \v \ — \vi\ — . . . — \vj_x\, the Same Length 
Lemma applies to show that there is only one chain (namely an interval of C) between any 
two of these compositions. Thus the chain C giving rise to / must rejoin C at an embedding 
rj' u of u into w, and hence also contain v\ in order to cut out a proper subinterval. From this 
and normality of r\ u we have 



Also, \u\ = \w\ — \ implies that C must zero out exactly one element of w. Since C < C, 
that element must be in a position strictly to the left of position c. But then because r] u and 
r]' u are both expansions of u we are forced to have r] u (c — 1) = r]' u (c), contradicting (13). 

Now we know that 1(C) consists of the descents and plateaus of C which are disjoint 
and cover C by their definition, so 3(C) = 1(C) and C is critical by Theorem 3.2. To count 
the number of MSI's, note that if c is a position counted by d(r] u ) then the vertex just before 
the edge labeled c in C will be a descent, unless that edge is the very first one. On the other 
hand, if c is counted by D(rj u ) then the run of c's in 1(C) contribute both a plateau and a 
descent just before the plateau to X (unless the run is at the beginning of C when only the 
plateau will be an interval). In this manner we count each MSI exactly once for a total of 
d(r) u ) + 2D(i] u ) — 1 intervals. □ 

Proposition 4.6. IfC is weakly decreasing and ends at an embedding rj u which is not normal 
then C is not critical. 

Proof. As in the proof of the previous proposition, it suffices to consider the case where 1(C) 
consists of a label c repeated k > 2 times so that w(c) = r] u (c) + k. If r/„(c) > then |u?| = \u\ 
and so the Same Length Lemma applies to show that C is the only chain from w to c. In 
particular, it is the lexicographically first chain and thus not critical. 

If rj u (c) = then w(c) = k and, since r] u is not normal, it must be that c is the first index 
in this run of fc's in w. Let 



To demonstrate that C is not critical, it suffices to show that there is no MSI / containing 
the element V\ in C . Suppose, to the contrary that such an interval / exists and let C be 
a chain giving rise to /. Using the Same Length Lemma as in the proof of Proposition 4.5 
(third paragraph) we see that C must rejoin C at u and this forces / = C(w,u). 

To finish the proof, it suffices to find a skipped interval I" C / since that will contradict 
the minimality of /. Let b be the smallest label in 1(C). Then b < c since 1(C) < 1(C) = (c k ). 
Now using the same argument as at the end of the third paragraph of the previous proposition 
as well as equation (14) gives 



n' u (c) < k = w(c - 1) = rju(c - 1). 



(13) 



r] u (c — 1) = w(c — 1) = h ^ k. 



(14) 



V'u(c) = Vu(c -l) = h^k = w(c). 
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Since the parts of a composition can only (weakly) decrease along a chain, we must have 
f]' u {c) < w(c). It follows that c is a label on C . Now consider any permutation of 1(C) which 
starts I" = (c,b, ...) and let C" be the chain specified by I". Then by construction and the 
Chain Specification Lemma l(C") < I" < 1(C). This implies that C" < C in PL-order and, 
by construction again, C" contains V\. Thus no MSI of C can contain V\, a contradiction. □ 

Our third and final proposition completes the proof of Theorem 4.4 

Proposition 4.7. If C is not weakly decreasing then C is not critical. 

Proof. If C is not weakly decreasing then it has an ascent v. It suffices to show that v is in 
no MSI. Suppose, to the contrary, that v is in an MSI / = C(vi,Vj). Then by the Descent 
Lemma, / contains no descents and so C is weakly increasing from Vi to Vj. But as in the 
previous two proofs, it is no loss of generality to assume vi = w and Vj = u so that C is itself 
an MSI. 

Since C is an MSI, it is not the first chain in [u, w]. That first chain is the unique weakly 
increasing chain which ends at the rightmost embedding p u of u into w. Thus if r] u is the 
embedding defined by C then we must have i] u ^ p u . 

For rj u define 

z v (j) = #{* < j | Vu(i) = 0} 

and similarly define z p (j) for p u . Because p u is rightmost we always have z p (j) > z v (j) 
with equality when j = \w\. But p u is not equal to i] u , so there is an index a such that 
z p (a) > z^a). Thus there is a first index c > a such that z p (c) = z v (c). This definition of c 
forces r] u (c) = and p u (c) = k for some k > 0. But rj u and p u are embeddings of the same 
composition, so there must be some index b with a < b < c such that r/ u (b) = p u (c) = k and 
Vu(i) = for b < i < c. 

We can now derive a contradiction by constructing a smaller skipped interval in C as 
follows. We have w(c) > p u (c) = k and r] u (c) = 0. Since C is weakly increasing, the labels 
equal to c must occur as a plateau. Therefore there must be vertices w',u' 6 C such that 
/' = C(w',u') satisfies r] w i(c) = k, r] u i(c) = 0, and /(/') = (c fc ). But 



Vw'W = Vu>{i) = Vu{i) 



k if i = b, 
if b < i < c, 



so there is a chain C from w' to u' with 1(C) = (b k ). Since b < c, /' is a skipped interval 

and we have obtained the desired contradiction. □ 

We can now rederive the formula for p(u,w) in P*. Combining equation (10) with The- 
orems 3.1, 3.2, and 4.4 we obtain 



p( U ,w) = x(u, W ) = j2(-if imac = j2(-i) dM+2DM ~ 2 = 



d(ri u ) 



where the first sum is over all critical chains C in (u, w) and the other two are over all normal 
embeddings r\ u into w . 

We end this section by remarking that the Morse method can be used as a powerful tool 
not just for proving theorems but for discovering the correct statement to be proved. The 
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reader may have found our definition of a normal embedding somewhat ad hoc. However, by 
starting with the very natural chain labeling used above and looking at the critical chains, 
one is quickly led to this definition in order to characterize the embeddings at which such 
chains end. Similarly, the defect may seem to have come out of nowhere, but in order to 
determine the dimension of the critical cells one is forced to define this quantity as well as 
its big brother D(j] u ). 

5 Generalized subword order 

We can now generalize both our result and Bjorner's as follows. Let (P, <p) be any poset. 
Generalized subword order is the partial order on P* obtained by saying that u <p* w if w 
contains a subsequence w(ii), 10(^2), • • • , such that u(j) <p w{ij) for 1 < j < I — \u\. 
We get ordinary subword order when P = A is an antichain and we get the composition 
poset when P = P. 

It is a simple matter to recast this generalized order in terms of embeddings. Let be 
a special element which is not in P and let P be the poset obtained by adjoining as a 
minimum element, i.e., <p x for all x G P. Then the definitions of support and expansion 
are as usual, just replacing with 0. An embedding of u into w is a length \w\ expansion rj u 
of u with 

Vu(i) <p w(i) for 1 < i < \w\. 

As expected, u <p* w if and only if there is an embedding of u into w. 

Finding an analogue of normality in this context is more delicate. So far, we have only 
been able to do it for a special class of posets. But there is evidence that more general 
results are possible; the next section contains a discussion of this issue. First note that the 
definition of a run carries over verbatim to any P*. Now call P a rooted tree if its Hasse 
diagram is a tree with a minimum element. A rooted forest is a poset where each connected 
component of its Hasse diagram is a rooted tree. Note that both antichains and chains are 
rooted forests. Note also that if P is a rooted forest then P is a rooted tree so the following 
definition makes sense. If x G P where P is a rooted forest then let x~ be the element 
adjacent to x on the unique path from x to in the Hasse diagram for P. For a rooted 
forest, a normal embedding of u into w is an embedding r\ u into w satisfying two conditions. 

1. For 1 < i < \w\ we have r] u (i) = w(i), w(i)~, or 0. 

2. For all x G P and every run [r, t] of x's in w, we have 

(a) (r, t] C Supp r] u if x is minimal in P, 

(b) r G Suppr^ otherwise. 

Finally, we nee the definition of defect in this situation, which is as expected: 

d(Vu) = #{i I Vu(i) = w(i)~} 

for a normal embedding i] u into w. The following theorem is the promised generalization of 
Theorems 1.1 and 1.2. Both of the two proofs we have given of the special case where P = P 
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generalize easily, with the minimal elements playing the role of 1 and the rest functioning 
like the integers k > 2. 

Theorem 5.1. Let P be a rooted forest. Then the Mobius function of P* is given by 

M^) = ]T(-ir, 

where the sum is over all normal embeddings r\ u of u into w. □ 

6 Comments and open problems 

There are several possible avenues for future research. We discuss some of them here. 

6.1 Generating Functions 

As mentioned in the introduction, Bjorner and Reutenauer [6] gave another proof of the 
formula for jj, in A* using generating functions on monoids. Let Z*((A)) denote the algebra 
of formal series using the elements in A as noncommutative variables and the integers as 
coefficients. Such a series can be written 

/ = ^2 c w w 

weA* 

for certain c w G Z. For example, given u G A* one can consider the series 

m(u) = ^2\ W ) w - (15) 

Bjorner and Reutenauer showed that (15) is rational for any u and obtained, upon special- 
ization of the variables, nice expressions for various ordinary generating functions associated 
with the Mobius function of A*. They also derived results for the zeta function of A*. The 
map m : A* — > Z((A)) can be extended to a continuous linear endomorphism of Z((A}}. In 
fact, the full incidence algebra of A* is isomorphic to a subalgebra of this endomorphism 
algebra. Bjorner and Reutenauer give another proof of Theorem 1.1 using this fact. 

It is natural to try and apply these ideas to P*, and more generally to rooted forests. 
This has been done by Bjorner and Sagan [7]. 

6.2 The poset of permutations 

Our original interest in P* came from the rapidly growing subject of permutation patterns. 
For an overview of permutation patterns the reader is referred to Bona's text [9]. Let S n 
denote the nth symmetric group and let it G S n and a G Si. We say that % contains a 
a-pattern, and write it > a, if there are indices i\ < %i < ■ ■ ■ < %i such that the subsequence 
TT(i\)ir{i2) . . . tt{%x) has the same pairwise comparisons as a(l)a(2) . . . o~(l). This subsequence 
is called a copy of a in it. For example, 312 < 24153 because of the copy 413. This is a 
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31524 




Figure 3: The Hasse diagram for the interval [1, 31524] in the pattern containment ordering 
on permutations 

partial order on the set of all finite permutations. Wilf was the first to ask the following 
question. 

<C To-do #1: It seems to me that our definition of pattern doesn't need a one- 
line notation warning, although our example 312 < . . . might. 

Question 6.1 (Wilf [26]). What can be said about the Mobius function of permutations 
under the pattern- containment ordering? 

Given two permutations it G S m and a G S n , their direct sum is the permutation of length 
m + n whose first m elements form a and whose last n elements are the copy of n gotten by 
adding m to each element of ir. For example, 132©32145 = 13265478. A permutation is said 
to be layered if it can expressed as the direct sum of some number of decreasing permutations. 
(An equivalent characterization of layered permutations is that they are the permutations 
that contain neither a 231-pattern nor a 312-pattern.) Our previous example is layered 
because 13254378 = 1 © 21 © 321 © 1 © 1. Clearly the set of layered permutation of length 
n is in bijection with the set of compositions of n. Almost as clearly, this bijection sends 
the pattern-containment order to the composition order we have considered, so Theorem 1.2 
answers Wilf's question for the set of layered permutations. 

Any normal embedding approach to describing the Mobius function for permutations in 
general must incorporate non-unitary weights, as witnessed by the fact that fi(l, 31524) = 6. 

6.3 Factor order 

Subword order is not the only partial order on the set of words. We say that the word u is 
a factor of the word w if there exist (possibly empty) words v\ and w 2 so that w = v±uv2, 
or in other words, if u occurs as a contiguous subword in w. Bjorner [5] showed that the 
Mobius function for factor order only takes on values in {0, ±1} and gave a recursive rule 
that allows the computation of fj,(u,w) in 0(|w| 2 ) steps. 
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a b 



Figure 4: The Hasse diagram for the poset A 

The factor order can be defined on P* for any poset P: we say that u is a factor of w if 
there are words Vx,v 2 , v 3 such that: 

1. W — ViV 2 V 3 , 

2. |l! 2 | = \u\, 

3. u{i) < V2{i) for all 1 < i < 

Indeed, this is one of the orders on P* studied by Snellman [20, 21]. The Mobius function of 
P* under factor order remains unknown. 

6.4 Subwords over A 

The smallest poset to which Theorem 5.1 is inapplicable is the poset A depicted in Figure 4. 
Still, the Mobius function of A* seems to be quite interesting. In fact, numerical evidence 
points to a surprising connection with the Tchebyshev polynomials of the first kind, T n (x), 
which can be defined as the unique polynomials such that 

T n (cos8) = cos(n9). 

Conjecture 6.2. For all i < j, n{a\c^) is the coefficient of x^~ l in T i+ j(x). 

As with the poset of permutations, a normal embedding interpretation of fi(a l , c l ) would 
need to use weights because, for example, //(a, cc) = —3. 

One possible way to attack this conjecture is to use the three-term recurrence for T n (x). 
Translating this in terms of the conjecture, it would suffice to show that 

fi{a\ c>) = 2/i(a\ c 7 ' -1 ) - ^(a i_1 , c 7 " 1 ) 

for j > i > 1 However, we have not been able to see any relationship between the intervals 
[a\ c J ], [a\ c J_1 ], and [a l ~ 1 , c- 5-1 ] which would permit us to derive this relation for their Mobius 
functions. 

There are two closely related areas where the Tchebyshev polynomials have appeared. 
A permutation n avoids a permutation a if it doesn't not contain a a-pattern. Chow and 
West [10] showed that the generating function for the number of elements in S n avoiding 
both 132 and 12 ... k for fixed k can be expressed in terms of Tchebyshev polynomials of 
the second kind. Mansour and Vainshtein [18] extended this result to count permutations 
avoiding 132 and containing exactly r copies of 12 ... k. 
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More recently, Hetyei [16] defined poset maps T and U which he called Tchebyshev 
transformations of the first and second kind. This is because when applied to the ladder 
poset L n , the cd-index of the images can be expressed in terms of T n (x) and U n (x). Since 
the cd-index is related to the Mobius function, it is conceivable that Hetyei's map could 
be used to prove our conjecture. But the posets T(L n ) are not isomorphic to any of our 
intervals [a^c 7 ] in general, so it is not clear how to proceed. However, these maps are very 
interesting in their own right and have been further studied by Ehrenborg and Readdy [11]. 

Acknowledgment. We are indebted to Patricia Hersh for useful discussions and references. 
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